Speed-up via Quantum Sampling 
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The Markov Chain Monte Carlo method is at the heart of efficient approximation schemes for a 
wide range of problems in combinatorial enumeration and statistical physics. It is therefore very 
natural and important to determine whether quantum computers can speed-up classical mixing 
processes based on Markov chains. To this end, we present a new quantum algorithm, making 
it possible to prepare a quantum sample, i.e., a coherent version of the stationary distribution of 
a reversible Markov chain. Our algorithm has a significantly better running time than that of a 
previous algorithm based on adiabatic state generation. We also show that our methods provide a 
speed-up over a recently proposed method for obtaining ground states of (classical) Hamiltonians. 

PACS numbers: 03.67.Ac, 87.10.Rt, 87.55.de 



I. INTRODUCTION 

Randomization plays a crucial role in the design of ef- 
ficient algorithms for computing approximate solutions 
to problems in combinatorial enumeration and statisti- 
cal physics that are known to be #P-complete. Impor- 
tant examples are randomized polynomial-time approxi- 
mation schemes for evaluating the permanent of a non- 
negative matrix [1], the volume of a convex polytope [2], 
and the partition functions of the monomer-dimer and 
ferromagnetic Ising systems 0, 3- The centerpiece of 
all these algorithms is the Markov Chain Monte Carlo 
(MCMC) method, making it possible to approximately 
sample from a particular probability distribution tt over 
a large set D,. 

In the MCMC method, one constructs a sparse, ergodic 
Markov chain (stochastic matrix) P on the state space fi 
such that its stationary distribution tt is the desired prob- 
ability distribution and then, starting from some initial 
state X, repeatedly applies P so that the resulting proba- 
bility distribution over Q, after r steps is sufficiently close 
to TT. The required number of step r is referred to as 
mixing time. Bounding the mixing time of the Markov 
chain is often the major technical hurdle in proving the 
running time of the overall algorithm. This problem can 
be reduced to estimating the spectral gap 5 of P as de- 
scribed in the following. 

We refer the reader to p, for more details on 
Markov chains. Let P be an ergodic (i.e., irreducible 
and aperiodic) reversible Markov chain with finite state 
space J7 and stationary distribution tt = (7r(a;)) „. Let 

P*-*' (x, y) denote the t-step transition probability from x 
to y and P'^'^^x, •) the probability distribution if we start 
in X and apply the Markov chain t times. To determine 
how fast the stationary distribution is approached from 
some initial probability distribution, we have to look at 
the spectral properties of the transition matrix P. 



It follows by the Perron-Frobenius theorem that sta- 
tionary distribution tt is the unique (left) eigenvector of 
P, i.e., TT^P = TT^, with associated eigenvalue Aq = 1. 
Let {Xj : 1 < j < N — 1} , Xj G M., denote the remain- 
ing eigenvalues (not necessary distinct), where N — |ri|. 
It also follows that these eigenvalues satisfy \Xj\ < 1 for 
1 ^ j < N — 1. Let us order the eigenvalues such that 



l = Ao>|Ai|> |A2|> 



> 



|A 



N-l 



>0. 



Let S denote the spectral gap of P, i.e., 5 = 1 — |Ai|. 
The rate of convergence to the stationary distribution is 
governed by the spectral gap 0, page 61]. 
The variation distance from initial state x is 



^'*^(^) = jEi^^*^(^'y) 



""(y)! 



yen 



For ee (0,1), let 



T^{x) = min{i : d'^^'^x) < e for aU t' > t} 
Tf — max Tf (x) . 

These quantities satisfy the following inequalities: 



1 

26 



log(2e)"i < T, 

Te{x) < -(^log(7r(a;))^^ -|-loge~^ 
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Given the fact the MCMC method is at the heart of so 
many efficient classical algorithms, it is very natural and 
important to determine to what extent quantum comput- 
ers can speed-up classical mixing. It possible to prepare 
the quantum sample Jtt), i.e., a coherent version of the 
stationary distribution where the amplitudes are given 
by the square roots of theprobabilities of the states by 
quantized Markov chains [7|, |8| . The running time of this 
algorithm is 0{l/y^dTT{x)). Basically, this algorithm is 
Grover search where marking of the target state is the re- 
flection around the quantum sample JTr). This reflection 
can be realized with cost 0{1/VS). 



Unfortunately, this running time is too high for apph- 
cations where the state space fl is exponentiahy large and 
tt{x) can be exponentially small. The question whether a 
quantum speed-up to 0{1/VS log(l/7r(a;)) is possible has 
been examined in [9|. The author proposes a method 
based on quantum walks that decohere under repeated 
randomized measurements to attack this problem. He 
shows that this speed-up is indeed achievable for the de- 
coherent quantum walk on a periodic lattice Z^. How- 
ever, the question whether this speed-up is achievable 
for arbitrary Markov chains remains an important open 
problem. 

In this paper, we propose a different method for 
speeding-up classical mixing processes. We show how 
to efficiently prepare the quantum sample JTr) pro- 
vided that we have a sequence of slowly-varying Markov 
chains in the following sense: (i) there are Markov 
chains Pq, Pi, . . . , Pr = P with stationary distributions 
ttq, TT, . . . , TTr such that distributions of adjacent Markov 
chains are sufficiently close and (ii) the quantum sam- 
ple Ittq) can be prepared efficiently. The idea of quantum 
state generation based slowly- varying Markov chains was 
proposed in [lO| , where adiabatic techniques were used to 
create the quantum samples Itt^) sequentially. The result- 
ing running time is Q{l/S), i.e., it does not provided the 
speed-up that would reduce S to y/S. 

We improve the running time by (a) using Szegedy's 
quantum walk operators instead of Hamiltonians derived 
from the Markov chains and (b) preparing the interme- 
diate quantum samples by amplitude amplification. The 
resulting running time is worse than Richter's conjec- 
tured running time 0(1/\M log(l/7r(a;)). However, in 
some situations, our running time is better than the clas- 
sical lower bound on the mixing time. 

We then apply our method to the special case of sim- 
ulated annealing. We obtain a better running time than 
that of a recently proposed method for this purpose 

mm- 

Their method makes use of Szegedy's quantum walk 
operators and the quantum Zeno effect. It yields a 
quadratic speed-up with respect to the spectral gap and 
provides an overall speed-up with respect to the classi- 
cal algorithm for simulated annealing. However, if one 
app lied their method to the general case considered in 
[10|, one would also obtain the quadratic speed-up with 
respect to the spectral gap but not the overall speed-up. 
The problem is that the quantum Zeno effect would result 
in an exponential slow-down in the general case. [17[ This 
is avoided by amplitude amplification in our approach, 
which can lead to an overall speed-up in the general case. 

The paper is organized as follows. In Section 2 we 
present a modified version of Szegedy's quantum ana- 
log of an ergodic reversible Markov chain whose unique 
eigenvector with eigenvalue 1 (on the relevant computa- 
tional subspace) is the quantum sample of the station- 
ary distribution. We also describe its spectral proper- 
ties in detail. In Section 3 we present a primitive for 
preparing quantum states based on amplitude amplifica- 



tion and also a primitive for implementing approximately 
phase gates that are needed for amplitude amplification. 
In Section 4 we use the primitives to obtain a quantum 
method for preparing the quantum sample of an arbi- 
trary reversible Markov chain provided that we have a 
sequence of slowly- varying Markov chains. In Section 5 
we show that our approach makes it possible to prepare 
quantum samples of Boltzmann-Gibbs distributions more 
efficiently. 



II. QUANTUM ANALOG OF CLASSICAL 
ERGODIC REVERSIBLE MARKOV CHAINS 

We refer the reader to [3, Q for more details on the 
quantization. Let TC = C^ ^ C^. The basis states of Ti 
are denoted by \xy) for x, y G 17. For a; G i7, define the 
normalized vectors 



\Px 



where p^y denotes the transition probability from x to y. 
A quantum update is any unitary U that satisfies 

U\x)\0) ^ \x)\p,) 

for some fixed state G fi and all x G ft. We refer to 
the cost to realize U and its inverse W as the quantum 
update cost. 

To construct the quantum walk, we define the sub- 
spaces 

A = span{|a::)|0) : x £ fl} 
B = U'^SUA, 

where S denotes that the unitary operator swapping the 
two tensor components of Ti. For K. = A,B, denote by 
Hyc the orthogonal projection onto K, and by 

i?K = 2Hk - / . 

the reflection around /C. 

Definition 1 (Quantum walk) The quantum walk 
W{P) based on the classical reversible Markov chain P 
is defined to be the unitary operation (rotation) 



W{P) ^Rb-Ra 



(1) 



The quantum walk W{P) can be realized by applying 
both U and W twice: 

W{P) ^U^ -S-U ■RA-U'f -S-U -Ra- 

Our definition of W{P) is equal to that in [ll|. This is 
different from the definition used in [3, Q 

W{P) = Rs-Ra 



where 



III. PRIMITIVES 



A = span{\x)\px) 
B = spa,-n{\px)\x) 



xen} = UA 
xen} = UB. 



Since W{P) and W{P) are equal up to conjugation by U, 
we can apply the spectral analysis from [3] to determine 
the spectrum of W{P). We refer to the subspace A + B 
as the busy subspace and to its orthogonal complement, 
i.e., A-^ (IB^, as the idle subspace. Clearly, the operator 
W{P) acts as identity on the idle subspace. On the busy 
subspace, the spectrum of W{P) is as follows. 

Theorem 1 Let P be a time-reversible Markov chain. 
Let 9i,...,6m e (0, f) be such that |Ai| = 
cos{9i), . . . ,\Xm\ — cos6'j\/ where M < N ~ 1 
and the remaining eigenvalues are equal to 0, i.e., 
AAf+i, • • ■ , Ajv-i = 0. 

L On AnB the operator W{P) acts as the identity I . 
This subspace is one dimensional and is spanned by 
the eigenvector |7r)|0) where 



is the quantum sample of the stationary distribution 

7T ofP. 

2. On Af] B^ and A^ f] B the operator W{P) acts 
as —I. The dimensions ofAnB-^ and A^ H B are 
equal to N—l — M, i.e., the dimension of the kernel 
of P. 

3. OnA+B those eigenvalues ofW{P) that have non- 
zero imaginary part are exactly e ^*^^ , ■ ■ • , e ^*^*^ 
with the same multiplicity. 

4- W{P) has no other eigenvalues on A + B. 

Proof: This follows from [^, Theorem 4] and the results 
in 0, Section 12]. 

In abuse of notation, we often use JTr) instead of |7r)|0). 
In the following, we always stay in the busy subspace 
A -\- B. This is important because we want to obtain 
the unique eigenvector Jtt) and not any other eigenvector 
with eigenvalue 1 contained in the idle subspace. 

The phase gap of A(P) of W{P) is defined to be 26*1. 
This is motivated by the above theorem since the angular 
distance of 1 from any other eigenvalue (corresponding 
to an eigenvector in the busy subspace) is at least A(P). 
The phase gap satisfies 



A(P) > |1 - e^''^'\ = 2y^l - A2 > 2y^6(P). 

This inequality is at the heart of the quadratic speed-up 
due to quantum walks. 



A. Preparation via amplitude amplification 

We use Grover's ^-amplitude amplification (fixed 
point search) , making it possible to drive a source state to 
the desired target state by applying a sequence of phase 
gates [l^ . It is a special case of the general approach to 
amplitude amplification based on the phase matrix that 
was introduced in [14| . 

Lemma 1 Let \ti) and |ti+i) be two arbitrary quantum 
states in C^ with \ {ti \ ti+ 1 ) p > p for some p with < p < 
1. Denote by Hi the projection on the subspace spanned 
by \ti) and by 11^ j^ the projection onto the orthogonal 
subspace. Let lo — e'a*. Define the unitaries 

Ri + l = LUl\-i+l + 11,; + ;^ . 

Define the unitaries Ui-m recursively as follows: 

C/i;0 = / 



u. 



i-.m+l 



Ui-m ■ Ri ■ Uj ■ Ri+l ■ Ui-r^ 



Then, at the mth level of recursion we have 

\{t^+l\U^,„^\t^)\^ > I - {1 - pf"^ . 

The unitaries in {Ri,Rl,Ri+i,RJ,^} are used at most 
3™ times. 

Note that the running time of the algorithm is worse than 
that of Grover's algorithm (the latter is also a special case 
of amplitude amplification based on the phase matrix). 
The reason why we cannot use Grover's algorithm is as 
follows. Since we only have a lower bound on the overlap 
between \ti) and |ti+i), we do not know how many Grover 
iterations we have to apply without overshooting. There- 
fore, we have to employ the version of Grover's algorithm 
considered in [l5|, making it possible to obtain the tar- 
get state even if the overlap is not known. However, the 
problem is that for this algorithm we have to prepare 
the initial state several times. This prevents us from us- 
ing Grover's algorithm because it is absolutely necessary 
for our primitive discussed below that the initial state is 
prepared only once. 

Corollary 1 Let \to) , . . . , \tr) be arbitrary quantum 
states in C^ with \{ti\ti+i)\'^ > p for i = 0, ... ,r — 1. 
Given the state {to), we can prepare a state \tr) such that 

\\\ir) - \tr)\\ <ei, 

for any ei > 0, by invoking the unitaries from {RijRJ : 
i = 0, . . . ,r} no more than 



L- 



12rlog(2r/ei) 
log (1/(1 -p)) 



times. 



Proof: Set g = 1 - p and M = 3™. For i = 0, . . . , r - 1, 
define l^^^i) = CA;m|^i)- K follows from Lemma 1 that 
Wi+i) can be expressed as 

\t^+l) ^ a\U+i) + P\ti,i) 

where a, (3 are two probability amplitudes with |a| > 

is some state with 



(^i+il^i+i) = 0- Consequently, we have 



V^, and \ti,,) 



\\tUi) - \U+i)\\ < 1 - v/r^^+ vV^ < 2vV^. 



For i — 0, 



1, define 



l^~ 



i+U 



J=0 



The task is now to show how to choose m so that 

|||tr>-|tr)|| <ei. 

To do this, we use induction. The base step is 

|||ti)-|ti)|| <2V?^. 
The inductive step is 

|||ir)-|tr)|| 
= |||ir) — Ur-l;m\tr-l) + Ur-l;m\tr-l) — \tr)\\ 
^ ||t^r-l;m|| ' || l^r-l) ^ l^r-l) || + || l^r) ^ l^r) || 
< \\\ir-l) - \tr-l)\\ + 2^/^ . 

We obtain 

\\\ir) - |tr)|| < 2r^/q^ . 

To make the norm distance less or equal to ei, it always 
suffices to choose M to be the smallest power of 3 satis- 
fying 



M > 



21og(2r/£i) 
log (1/(1 -p)) 



It follows that the unitaries from the set {Ri,Rl : i ~ 
0, . . . , r + 1} are used at most 2rM times. This number 
is bounded from above by 



L 



12rlog(2r/ei) 
log (1/(1 -p)) 



B. Approximate phase gates 

In this section we consider the case where the states 
jto), l^i), ■ • • , l^r) in Corollary 1 are quantum samples of 
stationary distributions. We show how to approximately 
implement the required phase transformation using quan- 
tum walks and a variant of the phase estimation algo- 
rithm. 



Lemma 2 Let W he a unitary acting on C' with unique 
eigenvector |V'o) with eigenvalue Aq = 1. Denote the re- 
maining eigenvectors and eigenvalues of W by {ipj) and 
\j = e^'^^fi for j = 1, . . . ,d — 1, respectively. Let 



A= min \(pj\ 

j = l,...,d-l 



be the phase gap of W . Let 



= [log(l/A)] 
= [log(l/V^)] 



for some £2 > 0- Then, there is a quantum circuit V 
acting on C''(8'(C^) that invokes the controlled-W gate 
at most 2" • c times and has the following properties 



where \xj) are some unit vectors in (C^) with 

(0---0|x,) = 0/orj-l,...,rf-l. 

Proof: First, we apply the phase estimation circuit U 
with a ancilla qubits as depicted below. The circuit U 
invokes the controlled-T4^ gate 2° — 1 times. 



|0> —H 



in^ TT 












|u; 11 






|u; ii 












1,;, \ 


W^" 




TIA^l 




IVj/ 


w 








n"' 



\®a 



We have 

C/|^,)|0...0)' 

= |^,.)«DFTt(^'^' 



e^'^'""^^- Im) 



2"-! 



I^j)®^ ^ g2^*mV, g-2™nm'/2»|^/^ 



The amplitude a^' of the state \m') is 

1 ^ ~ "^ , 1 1 „27ri(2"cpj-m') 

±_ V^ 27ri(ipj-m72'')m lie 



2a 1 p27ri((/?j— 7n'/2^) 



Observe that for j = 



1 if m' = 
if m' 7^ 



and so 



C/|7^o)|0)«'^ = |V^o)|0)^'^. 



Now consider the case j ^ 0. To bound |ao|, we use the 
inequaUty |1 — e"| > 2|a;|/7r whenever — tt > x > tt. We 
obtain 



l"0| = TTT 



< 



Let 



< 



< 



< 



1 


1 


_ g27ri2"ipj 


2" 
1 


] 


_ g27riVj 
1 


2a— 1 

1 




1 - e27riipj 
TT 


2"-i 
1 




2.27r|^,| 


2<^+i|^,| 
1 



2"+iA 
1 

2 ■ 



We conclude that for j y^ we have 

C/|^,)|0)«'^ = a|^,)|x,>+c.o|V',>|0)^^ 

where {0---0\xj) == 0, |a| > V3/2, and |ao| < 1/2. 

Since we are only interested in the amplitude of the 
state jo)®" on the ancilla qubits, wc can replace the in- 
verse discrete Fourier transform DFT' in the phase esti- 
mation circuit by the Walsh-Hadamard transform iJ®". 
This is seen as follows. Observe that both H'^"- and DFT 
create the uniformly weighted superposition of all com- 
putational basis states when applied to jO)®", implying 
that 

(O---O|DFTt|0) = (O---O|i/®"|0) 

for an arbitrary state \(j)) on the ancilla register. 

Second, we reduce the "error amplitude" to y/e2 by 
applying the circuit U c times, using a new block of a 
ancillas each time. Let V be the resulting circuit. V 
invokes the controlled- VF gate (2" — l)c times. For j ^ 0, 
we have 



TM,)W 



as desired. For j = 0, we have 

T/|V'o)|0)®"^ = |Vo|( 
This completes the proof. 



Corollary 2 Let W he a unitary acting on C^ with 
unique eigenvector jf/'o) with eigenvalue Ao = 1. Denote 
the remaining eigenvectors and eigenvalues ofW by lipj) 
and Xj — g'^'^^Vj Jqj- j — l,...,rf— 1^ respectively. Let 



A 



] = 1,...A-1 



be the phase gap of W . Let 11 he the projector onto the 
space spanned by jf/'o) and li^ the projector onto the or- 
thogonal complement. Let R he the unitary that acts on 
C^ as follows 



a 
c 



[log(l/A)] 
[log(l/Vi;)l 



for some 62 > 0. Then, there is a quantum circuit R 
acting on C''(8)(C^)®"'^ that invokes the controlled-W gate 
2""*"^ • c times and has the following properties: for j — 0, 

i?|^o)|0)«-=(i?|^o))|0)^'^^ 

and for j 7^ 0, 

i?|^,)|0)«-=(i?|^,))|0)«-+|^), 

where |^) is some error vector in C* (E) (C^)'^"'^ with 

III0II<2V^. 

Proof: Let 

R^V^ ■ {Id® Q) ■ V, 
where Q is the following phase gate 



that acts on the ancilla register. 
For j ~ 0, it clear that 

i?|^o)|0)®"^ - (i?|^o»|0)®"^ -c.|^,)|0) 



®ac 



Let us now analyze the action of R for j ^ 0. The state 
after the application of V is 



where (xjlO)**"^ = 0. The state after the application of 

/d (g) Q is 



IV',) ® (yr^^ix,) + v^2\or^') + 1^') , 



where 



IO = V^(^-i)l^,)|o)«'^= 



R 



M + W 



We have |||^')ll ^ '^\f^- In the final step, the application 
of V'^ leads to the state 

|V;,)|0)®- + T/t|O = |V^,>|0)®'^^ + |0 
with \\\i)\\ < 2^. We conclude 

i?l^^.)|o)®- = l^,)|o)®- + IO = (i?|^,))|o)«- + IO. 

It follows from Lemma 2 that V and V^ invoke the 
controUed-iy gate (2" — l)c times. So, R invokes the 
controUed-Ty gate 2(2" - l)c < 2°+^ • c times. 



IV. QUANTUM SAMPLING 

We now use Corollary 1 and Corollary 2 to prove the 
following theorem. 

Theorem 2 Let Po,Pi, . . . ,Pr be classical Markov 
chains with stationary distributions ttq, tti, . . . , tt^ and 
spectral gaps So, Si, ... , Sr, respectively. Assume the sta- 
tionary distributions of adjacent Markov chains are close 
to each other in the sense that their stationary distribu- 
tions TTi and TTi+i are close with respect to fidelity, i.e., 

fori^O,...,r-l, 

min{(5i : i = 0, . . . ,r} > S , 

and we can prepare the quantum sample Ittq). 

Then, for any e > 0, there is a quantum sampling algo- 
rithm, making it possible to sample according to a prob- 
ability distribution jfr that is close to iTr with respect to 
the total variation distance, i.e., D{frr,TTr) < £• 

The algorithm invokes the controlled-Wi operators at 
most 2°'~^^ ■ c ■ L times where 



L = 



12rlog(8r/e) 
log (1/(1 -p)) 

log(l/A)" 

/ 96rlog(8r/e) 
°H, log (1/(1 -p)) 



Proof: Corollary 1 shows that, given the initial state 
JTTo), we can prepare a state \TTr) with Wlnr) — \TTr)\\ < ei 
by invoking the unitaries from the set {Ri,R\ : i — 
0, . . . , r — 1} no more than 



L 



12rlog(2r/ei) 
log (1/(1 -p)) 



times. 

In Corollary 1 we assumed that we can implement 
these exactly. However, in reality we can only imple- 
ment the operators Ri and their inverses R\ as described 
in Corollary 2. This approximation adds an error vector 
1^) every time an operator Ri or i?J is applied, where 
111^)11 < 27iJ for some £2 > 0. 

Let IV') be the state obtained by implementing Corol- 
lary 2 using Ri to approximate Ri. Then, since these 
operators or their inverses are invoked no more than L 
times we have 

where c' — [log(l/Y^)] and |^) is some vector with 

IIIOII<2iV^. 



7i is the Hilbert space that our quantum samples live in 
and A is the Hilbert space of the ancilla qubits that are 
required to implement the approximate phase gates Ri 
and their inverses. 
Let 



be the ideal state. We choose ei 
eV(64L2) so that 



e/4 and £2 



iii^)-iv^>ii < iii^)-i^.>io>^°i + iiMio)®-- 1^)11 

- e/4 + e/4 
= e/2. 

For each x € il, we define the projector 

acting onli,® A. Let 

Let Vt' = riU{0}. Observe that the desired distribution 
■K is equal to the probability distribution given by 

^(x) = iiA,iv)ir 

Our protocol yields the probability distribution 

^(x) = ||A,|^)||2. 

We now bound the total variation distance between tt 
and TT from above. For a subset S C 51' let 



A, 



xes 



We have 



^(vr,^) 



max Wis) — tt(S)\ 
sen' I ' ' ' ' I 



= max 

sen' 




ksH) '- 


Asl^) ' 


< 2 max 

sen' 


Asl^) - 


- ks\i^) 


< 2 IV-)"!^) 




< e. 









It follows from Corollaries 1 and 2 that we invoke the 
controlled- VFi operators or their inverses at most 2°-^^-c-L 
times. 



V. QUANTUM SIMULATED ANNEALING 

The Metropolis algorithm refers to a general construc- 
tion that transforms any irreducible Markov chain on 
state space f2 to a time-reversible Markov chain with a 



required stationary distribution. We consider the case 
where the desired stationary distribution is equal to 
the Boltzmann-Gibbs distribution irp of some (classical) 
Hamiltonian H at (inverse) temperature f3. We show how 
to prepare the corresponding quantum sample Jtt^) using 
Theorem 2. The resulting algorithm has a better running 
time than that of the recently proposed algorithm based 
on the quantum Zeno effect [111 . Il2l | . 

For completeness, we give a short description of the 
Metropolis algorithm. The presentation is based on [la . 
Lemma 10.8]. For a finite state space fl and neighbor- 
hood structure {N{x) : x G fi}, let N — maxa;go |-^(2;)|- 
Let M be any number such that M > N. For all x G ft, 
let E{x) > be the energy of the state x. The desired 
stationary distribution is np = (tt^ (x) : x G fl) , where 



np{x) 



eyi-p{-E{x)P 



is the probability of state x and 



Z0 = J2 exp(-/3£;(x)) , 



denotes the partition function at temperature p. 

Consider the Markov chain Pfj whose transition prob- 
abilities pxy are: 

min|l,exp((^(y)-^(a;))/5)|/Af iix^y,ye N{x) 
ifx^y,y^ N{x) 

^-J2z^xP^^ if a; = 2/ 

Then, if this chain is irreducible and aperiodic, Pp is 
time-reversible and its stationary distribution is given by 
the Boltzmann-Gibbs probabilities TTp{x). Let S/s denote 
its spectral gap. 

Let H be the Hamiltonian defined by 

H = J2e{x)\x){x\. 
xen 

We need the following simple lemma that characterizes 
how the quantum sample changes when the temperature 
is increased. 

Lemma 3 The quantum samples of the stationary distri- 
butions of the above Metropolis process at temperatures j3 
and P + A/3 satisfy 

1(^/5 k/5+A/3)P>exp(- II 7J|| A/3) 

for all j3 and all A/3. 

Proof: We have 

(tt^Itt^+a/s) 

exp(-/3S(x)/2) exp(-(/3 + A/3)£;(x)/2) 



E 
E 

xeii 
> exp( 



> 



exp(-/3£:(x)) 

|i/||A/3/2). 



exp{-E{x)A(3/2) 



The first inequality follows from the fact that Zp > 
Zp+Af3- By taking the square, we obtain the desired re- 
sult. 

The following corollary follows directly from Theorem 2 
by observing that p = 1/e if we set A/3 = l/|li?|l in 
Lemma 3. 

Corollary 3 Letr^ P\\H\\, P^ = i/\\H\\fori = Q,...,r, 

S < mm{Si3. : i = 1, . . . ,r} 

be a lower bound on the smallest spectral gap and A the 
phase gap corresponding 6. Then, for any e > 0, there is 
a quantum algorithm that outputs the states x according 
to a probability distribution lifs^ with 

D{ttp,,tti3^) < e. 

The algorithm invokes the operators from {Wf3.} at most 



2a^ 



c-L 



times where 
L = 



12/3||g||log(8/3||g||/6) 

log(e/(e-l)) 
[log(l/A)] 

96/3||i7||log(8/3||i7||/e) 



log(- 



elog(e/(e-l)) 



Lemma 4 Let H be a Hamiltonian acting on a state 
space of cardinality d with spectral gap 7. Let H be 
the projector onto the eigensubspace corresponding to the 
minimal eigenvalue. Then 

|(7r^|n|7r^)|2>l-e3. 

provided that the inverse temperature satisfies 

■{l~e3)d^ 



/?>-log(- 

7 ^ 



£3 



(2) 



Proof: The worst case occurs when the ground state 
\g) is unique and all other states have energy 7 -I- E{g) 
where E{g) is the ground state energy. In this case the 
probability of obtaining the ground state when measuring 
Itt^) in the computational basis is 



^ {d - l)e-il^ + 1 ' 

To make this probability greater or equal to 1 — 63, it 
suffices to choose the inverse temperature /3 as in eq. ([2]). 

By choosing e = 1/4 in Corollary 3 and £3 = 1/4 in 
Lemma 4 we obtain the following corollary. 

Corollary 4 There is a quantum algorithm that outputs 
a ground state of H with probability greater than 1/2. It 
invokes the operators from {Wp- : i — I, . . . ,r} at most 



1 

A 



\H\\ 



\ogd ■ log 



\H\ 



logd 



(3) 



times. 



8 



Let us explain how the above algorithm differs from the 
algorithm based on the quantum Zcno effect p3l- That 
algorithm has the running time 



1 

A' 



m 
1 



log'd, 



(4) 



where A' is the phase gap corresponding to the minimal 
spectral gap in the sequence of Markov chains. 

Both algorithms make use of Szegedy's quantum walk 
operators to obtain a speed-up over the classical case 
due to the quadratic relation between phase gaps and 
spectral gaps. The reduction from (||i?||/7)^ in (|4]) to 
11^11/7 in © is due to the advantage of amplitude am- 
plification over the quantum Zeno effect. Note also that 
A' < A because the change in temperature between ad- 
jacent Markov chains is A/3' = 0{-i /{WHW^ log d)) in ^ 
and A/3 — 1/\\H\\ in our algorithm. Roughly speaking, 



amplitude amplification makes it possible to make "big- 
ger" jumps (i.e., bigger changes in temperature) than 
the quantum Zeno effect, without decreasing the success 
probability. 

Conclusions and Discussions. — We have presented a 
simple quantum algorithm, making it possible to prepare 
quantum samples of stationary distributions of arbitrary 
slowly-varying Markov chains. It significantly improves 
upon a previous algorithm for that purpose based on adi- 
abatic generation [lO|. It also provides a speed-up over 
a recently proposed method for preparing quantum sam- 
ples of Boltzmann-Gibbs distributions of classical Hamil- 
tonians |13 |. 
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